Search CORE

46 research outputs found

Compressing Explicit Voxel Grid Representations: fast NeRFs become also small

Author: Tartaglione Enzo
Publication venue: IEEE
Publication date: 01/01/2023
Field of study

Institutional Research Information System University of Turin

Pruning artificial neural networks: a way to find well-generalizing, high-entropy sharp minima

Author: Bragagnolo Andrea
Grangetto Marco
Tartaglione Enzo
Publication venue
Publication date: 01/01/2020
Field of study

Recently, a race towards the simplification of deep networks has begun, showing that it is effectively possible to reduce the size of these models with minimal or no performance loss. However, there is a general lack in understanding why these pruning strategies are effective. In this work, we are going to compare and analyze pruned solutions with two different pruning approaches, one-shot and gradual, showing the higher effectiveness of the latter. In particular, we find that gradual pruning allows access to narrow, well-generalizing minima, which are typically ignored when using one-shot approaches. In this work we also propose PSP-entropy, a measure to understand how a given neuron correlates to some specific learned classes. Interestingly, we observe that the features extracted by iteratively-pruned models are less correlated to specific classes, potentially making these models a better fit in transfer learning approaches

arXiv.org e-Print Archive

Crossref

HAL Descartes

Institutional Research Information System University of Turin

From Statistical Physics to Algorithms in Deep Neural Systems

Author: Tartaglione Enzo
Publication venue: Politecnico di Torino
Publication date
Field of study

L'abstract è presente nell'allegato / the abstract is in the attachmen

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Learning Sparse Neural Networks via Sensitivity-Driven Regularization

Author: Fiandrotti Attilio
Francini Gianluca
Lepsøy Skjalg
Tartaglione Enzo
Publication venue
Publication date: 01/01/2018
Field of study

The ever-increasing number of parameters in deep neural networks poses challenges for memory-limited applications. Regularize-and-prune methods aim at meeting these challenges by sparsifying the network weights. In this context we quantify the output sensitivity to the parameters (i.e. their relevance to the network output) and introduce a regularization term that gradually lowers the absolute value of parameters with low sensitivity. Thus, a very large fraction of the parameters approach zero and are eventually set to zero by simple thresholding. Our method surpasses most of the recent techniques both in terms of sparsity and error rates. In some cases, the method reaches twice the sparsity obtained by other techniques at equal error rates

arXiv.org e-Print Archive

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Can we avoid Double Descent in Deep Neural Networks?

Author: Quétu Victor
Tartaglione Enzo
Publication venue
Publication date: 04/07/2023
Field of study

Finding the optimal size of deep learning models is very actual and of broad impact, especially in energy-saving schemes. Very recently, an unexpected phenomenon, the ``double descent'', has caught the attention of the deep learning community. As the model's size grows, the performance gets first worse, and then goes back to improving. It raises serious questions about the optimal model's size to maintain high generalization: the model needs to be sufficiently over-parametrized, but adding too many parameters wastes training resources. Is it possible to find, in an efficient way, the best trade-off? Our work shows that the double descent phenomenon is potentially avoidable with proper conditioning of the learning problem, but a final answer is yet to be found. We empirically observe that there is hope to dodge the double descent in complex scenarios with proper regularization, as a simple

\ell_2

regularization is already positively contributing to such a perspective

arXiv.org e-Print Archive

To update or not to update? Neurons at equilibrium in deep models

Author: Bragagnolo Andrea
Grangetto Marco
Tartaglione Enzo
Publication venue: Curran Associates Proceedings.com
Publication date: 01/01/2022
Field of study

Institutional Research Information System University of Turin

EnD: Entangling and Disentangling deep representations for bias correction

Author: Barbano Carlo Alberto
Grangetto Marco
Tartaglione Enzo
Publication venue
Publication date: 01/01/2021
Field of study

Artificial neural networks perform state-of-the-art in an ever-growing number of tasks, and nowadays they are used to solve an incredibly large variety of tasks. There are problems, like the presence of biases in the training data, which question the generalization capability of these models. In this work we propose EnD, a regularization strategy whose aim is to prevent deep models from learning unwanted biases. In particular, we insert an "information bottleneck" at a certain point of the deep neural network, where we disentangle the information about the bias, still letting the useful information for the training task forward-propagating in the rest of the model. One big advantage of EnD is that we do not require additional training complexity (like decoders or extra layers in the model), since it is a regularizer directly applied on the trained model. Our experiments show that EnD effectively improves the generalization on unbiased test sets, and it can be effectively applied on real-case scenarios, like removing hidden biases in the COVID-19 detection from radiographic images

arXiv.org e-Print Archive

HAL Descartes

Institutional Research Information System University of Turin

On the Role of Structured Pruning for Neural Network Compression

Author: Bragagnolo Andrea
Fiandrotti Attilio
Grangetto Marco
Tartaglione Enzo
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

International audienc

HAL Descartes

Institutional Research Information System University of Turin

LOss-Based SensiTivity rEgulaRization: towards deep sparse neural networks

Author: Bragagnolo Andrea
Fiandrotti Attilio
Grangetto Marco
Tartaglione Enzo
Publication venue: 'Elsevier BV'
Publication date: 16/11/2020
Field of study

LOBSTER (LOss-Based SensiTivity rEgulaRization) is a method for training neural networks having a sparse topology. Let the sensitivity of a network parameter be the variation of the loss function with respect to the variation of the parameter. Parameters with low sensitivity, i.e. having little impact on the loss when perturbed, are shrunk and then pruned to sparsify the network. Our method allows to train a network from scratch, i.e. without preliminary learning or rewinding. Experiments on multiple architectures and datasets show competitive compression ratios with minimal computational overhead

arXiv.org e-Print Archive